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Abstract 

Background: Eukaryotic DNA replication follows a specific temporal program, with some genomic regions 
consistently replicating earlier than others, yet what determines this program is largely unknown. Highly transcribed 
regions have been observed to replicate in early S-phase in all plant and animal species studied to date, but this 
relationship is thought to be absent from both budding yeast and fission yeast. No association between cell-cycle 
regulated transcription and replication timing has been reported for any species. 

Results: Here I show that in budding yeast, fission yeast, and human, the genes most highly transcribed during 
S-phase replicate early, whereas those repressed in S-phase replicate late. Transcription during other cell-cycle 
phases shows either the opposite correlation with replication timing, or no relation. The relationship is strongest 
near late-firing origins of replication, which is not consistent with a previously proposed model — that replication 
timing may affect transcription — and instead suggests a potential mechanism involving the recruitment of limiting 
replication initiation factors during S-phase. 

Conclusions: These results suggest that S-phase transcription may be an important determinant of DNA replication 
timing across eukaryotes, which may explain the well-established association between transcription and replication 
timing. 



Background 

The timing of DNA replication during S-phase of the 
cell cycle plays an important role in genome integrity, 
the mutational spectrum, and a wide range of human 
diseases [1]. Despite many recent advances in our ability 
to measure the time of replication (T rep ) across entire 
genomes [2-7], our understanding of what regulates this 
timing remains far from complete [1,8-11]. The time at 
which origins of replication (ORIs) fire is thought to be 
determined in M-phase [12] or Gl [13,14], at which 
point factors such as Cdc45 and Sld3 bind to ORIs that 
will fire early in the following S-phase [15,16]. These 
and several other proteins critical for replication initi- 
ation are present at copy-numbers lower than the num- 
ber of ORIs [17-19], and their over-expression advances 
T rep for many late-firing ORIs in both budding and fis- 
sion yeast [12,17-20], suggesting that their re-use may be 
a key step in regulating ORI firing time. However, what 
determines the relative affinities of different ORIs for 
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these limiting factors - and hence their temporal order 
of initiation - is largely unknown [19]. 

Among the strongest correlates (and potential deter- 
minants) of T rep in metazoans are transcriptional activity 
and chromatin state. Although transcriptionally active 
euchromatin has been known to replicate earlier than 
repressive heterochromatin for over 50 years [11,21], the 
reason - and even the direction of causation - has 
remained elusive. The two major models [8,11], not mu- 
tually exclusive, are that 1) the euchromatic chromatin 
structure is more permissive both to transcription and 
to DNA replication initiation, or 2) T rep itself affects 
chromatin structure and transcription as a result of 
changes in the nuclear milieu during S-phase. The 
former is most directly supported by experiments alter- 
ing ORI firing time via manipulation of histone modifi- 
cations [8-10,18,22-24], whereas the latter is supported 
by differences in chromatin and transcription of DNA 
templates injected into cells during either early or late 
S-phase [8,9,25,26]. 

Measuring T rep genome-wide in the budding yeast 
Saccharomyces cerevisiae (Sc), Raghuraman et al. [2] 
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reported a surprising lack of association between tran- 
scription and T rep (with the exception of the eight histone 
genes, which are highly transcribed in S phase and are 
replicated early). However, this analysis only involved clus- 
ters of co-expressed genes, and did not actually compare 
the highest- versus lowest- expressed genes. Nevertheless, 
it has been widely interpreted in the literature as indicat- 
ing the absence of any association, and many authors have 
speculated as to why budding yeast lacks this relationship 
[5,8-11]. Similarly, the fission yeast Schizosaccharomyces 
pombe {Sp) is thought to lack any association between 
transcription and replication timing [8], though again no 
systematic comparison has been reported. 

Results 

Because DNA replication is confined to a specific period 
during the cell cycle, I reasoned that the relationship be- 
tween T rep and transcription may depend on when in the 
cell cycle transcription is occurring. The transcription of 
most genes does not vary greatly throughout the cell 
cycle, so cannot be used to determine phase-dependent 
effects. However, several hundred genes have been iden- 
tified in both Sc and Sp for which transcription does vary 
consistently during the cell cycle [27,28]. I compared the 
expression levels of these cell cycle-regulated genes mea- 
sured in synchronized cells [27,28] with the T rep for each 
gene to determine if any relationship exists. For both Sc 
and Sp expression levels measured in G2 phase, higher 
expression associated with earlier T rep (Figure 1A). How- 
ever, at other points in the cell cycle the relationship was 
quite different; mostly notably in M/Gl {Sc) or Gl {Sp), 
the relationship reversed, such that highly expressed 
genes were replicated late (Figure 1A). 

To more systematically visualize these patterns, I cal- 
culated the correlation between the expression levels of 
all cell cycle-regulated genes measured in synchronized 
cultures [27,28] with their T rep , separately for each ex- 
pression data time-point (see Materials and methods). 
Plotting these correlation coefficients as a function of 
the time at which the expression data were sampled, I 
found a striking relationship: both the strength and dir- 
ection of the correlation oscillate as a function of cell- 
cycle stage (Figure IB). In these plots, positive r values 
represent time-points at which up-regulated genes tend 
to be replicated late in S phase; negative r values indicate 
times when up-regulated genes are replicated early. Con- 
sistent with the results in Figure 1A, in both species of 
yeast, genes highly expressed in G2 phase are replicated 
early, while those expressed in late M/Gl are replicated 
late. The oscillation is observed regardless of the method 
used to achieve cell-cycle synchronization (Additional 
file 1: Figures SI and S2). 

To further characterize this relationship, I plotted a 
moving average of T rep for the cell cycle-regulated genes 



in each species, ordered by their time of maximal ex- 
pression. If expression in certain cell-cycle phases corre- 
lates with early or late replication, this should be reflected 
by troughs or peaks in such a plot. Again in both species a 
similar trend emerged: T rep reaches a maximum for genes 
expressed in Gl, and a minimum for those expressed in 
G2 (Figure 1C; Additional file 1: Figure S3), consistent 
with the correlation analysis (Figure IB). The strong con- 
servation of this pattern was surprising, considering how 
much the regulation of DNA replication has diverged in 
the hundreds of millions of years separating these two 
yeast lineages [29] . 

Although the strongest association between high 
mRNA levels and early replication was observed for G2- 
phase expression levels, it is important to note that this 
does not imply these genes are maximally transcribed in 
G2. Rather, one would expect maximal transcription to 
occur in the time leading up to the maximal transcript 
level, that is, in S phase. Indeed, plotting mRNA levels for 
G2-upregulated genes (those with early T rep in Figure 1C), 
it is clear that their transcript levels show the greatest 
increase - likely reflecting active transcription - in S 
phase (Additional file 1: Figure S4A). Likewise, genes 
with late T rep show the opposite pattern: maximal de- 
crease in mRNA levels during S phase (Additional file 1: 
Figure S4B). 

The oscillating relationships shown in Figure 1 do not 
establish whether T rep is more directly associated with 
transcription in S phase or in M phase. For example, if 
M-phase repression led to early T rep , S -phase induction 
could be associated with early T rep simply as an indirect 
consequence, because genes repressed in M phase are 
typically induced in S phase (Additional file 1: Figure 
S4A). To disentangle the effects of S and M phases, I ex- 
amined the T rep of genes that are expressed at similar 
levels throughout the cell cycle. If M-phase repression 
leads to early T rep , then genes repressed throughout the 
cell cycle would be expected to have early T rep , as a re- 
sult of their repression in M phase (in this scenario, their 
S -phase expression levels are not relevant). However, if 
the association is instead due to S -phase induction, 
genes with constitutive high expression would have earl- 
ier T rep because of their active transcription in S phase 
(in which case M-phase expression levels would be ir- 
relevant). This analysis showed a clear trend: highly 
expressed genes replicate 5.9 minutes earlier in Sc 
and 3.0 minutes earlier in Sp (Figure 2). Therefore, the 
results shown in Figure 1 can be entirely, and most par- 
simoniously, explained by the association of T rep with 
S -phase transcription; the M-phase relationship is likely 
to be an indirect side effect of this. This result also 
suggests a more general association between transcrip- 
tion and T rep in yeast that extends beyond cell cycle- 
regulated genes. 



Fraser Genome Biology 201 3, 14:R1 1 1 
http://genomebiology.com/2013/l4/10/R1 1 1 



Page 3 of 9 



S. cerevisiae 



5. pombe 



G2 phase 



M/Gl phase 



G2 phase 



Gl phase 




Repressed Induced Repressed Induced 
genes genes genes genes 

(bottom decile) (top decile) (bottom decile) (top decile) 



Repressed Induced 
genes genes 
(bottom decile) (top decile) 



Repressed Induced 
genes genes 
(bottom decile) (top decile) 



B 



0.15 



O 0.1 
_ro 

cu 0.05 
O 

U c 0 

^ -0.05 
O 

£ -0.1 

cu 

x" -0.15 
-0.2 



— 1 


" if 






M/Gl | Gl | S | 


G2 | M | M/Gl | Gl | S | G2| M 



0.2 
J 0.15 

ID o.i 
8 0 05 
0 

g-0.05 

8 -0.1 
CU 

O--0.15 
-0.2 
-0.25 















G2 1 M 1 Gl 1 S 1 G2 1 M 1 Gl 1 S 1 G2 



14 21 28 35 42 49 56 63 70 77 84 9" 

Minutes after a-factor release 



15 30 45 60 75 90 105 120135150165180195 210 225 240 255 270 285 

Minutes after elutriation 




Moving average of cell-cycle regulated genes 
ordered by time of maximal expression 



Moving average of cell-cycle regulated genes 
ordered by time of maximal expression 



Figure 1 The transcription/T rep association varies by cell-cycle stage. (A) Comparing mean T rep of the top decile (10%) of most-induced versus 
most-repressed cell cycle-regulated genes reveals that genes highly expressed in G2 replicate early in both Sc and Sp, whereas those highly expressed 
in M/G1 (Sc) or G1 (Sp) replicate late (error bars represent the standard error). (B) The correlation between T rep and expression levels of known cell 
cycle-regulated genes was calculated separately for expression levels from each time point of cell cycle-synchronized time courses [27,28]. An 
oscillation of the correlation coefficient (Pearson's r) was observed for both budding yeast (all \r\ > 0.107 are significant at P < 0.0025) and fission yeast 
(all |r| > 0.1 77 are at P < 0.0025). The approximate cell-cycle phase of each time point is shown [27,28]. Similar oscillations are observed for other 
methods of synchronization as well (Additional file 1: Figures S1 and S2). (C) A moving average of T rep is shown for all cell cycle-regulated genes, 
arranged in order of their time of maximal expression, beginning immediately following mitosis. A similar pattern is observed for both yeast species, 
with the latest T rep for genes with maximal transcript levels in Gl, and the earliest T rep for genes with maximal transcript levels in G2. 



To further investigate the connection between S -phase 
transcription and T rep , I tested whether the relationship 
differed for genes replicated during early versus late S 
phase. In this analysis I separated all cell cycle-regulated 
genes into 10 bins (that is, deciles) by their T rep , and 
plotted the median G2-phase transcript level (the time- 
point most closely reflecting S -phase transcription; 
Additional file 1: Figure S4A) for each. Across all 10 T rep 
bins, I observed the expected relationship: decreasing 
expression of bins with increasing T rep (Figure 3A). 
However, closer examination revealed that, for both 



yeast species, this pattern was almost entirely driven by 
late-replicating genes. In other words, there was no cor- 
relation between expression levels and T rep for genes in 
the first five bins (replicated in early S phase), while in late 
S phase the relationship was quite strong (Figure 3A). 
Consistent with this, applying the correlation analysis 
from Figure 1A to just early- or late-replicating genes re- 
vealed that the oscillation is entirely driven by replication 
in late S phase; genes with early T rep showed no oscilla- 
tion, and only a weak correlation at nearly all time-points 
(Figure 3B). These results parallel the finding in mouse 
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Figure 2 Asynchronous gene expression associates with T rep in budding and fission yeast. Comparison of the 100 highest-expressed 
genes with the 100 lowest-expressed shows that highly expressed genes are replicated earlier in both budding yeast and fission yeast. Error bars 
represent the standard error. 



that genes replicated in the second half of S phase show 
the strongest association between transcript levels and 
T rep [71 

Another factor that may influence the relationship be- 
tween S -phase transcription and replication timing is a 
genes distance from the nearest ORL Under the model 
where chromatin affects both transcription and T rep , the 
strongest association would be expected for genes near 
ORIs, whereas if instead T rep affects a genes level of 
S -phase transcription, the relationship should be inde- 
pendent of distance to the nearest ORI [3]. Separating 
genes into two classes, ORI-proximal or ORI-distal, the 
ORI-proximal class showed far stronger oscillations 
(Figure 3C; ORI distance cutoffs, chosen to result in ap- 
proximately equal-sized lists, were 5 kb from the 
nearest ORI in Sc and 10 kb in Sp, due to the higher 
density of known ORIs in Sc; results from equal dis- 
tance cutoffs are shown in Additional file 1: Figure S5). 
Because ORI-proximal genes tend to be replicated earlier 
than ORI-distal genes, this result could not be an indirect 
effect of the stronger association for late T rep genes, as it 
acts in the opposite direction. This result suggests that the 
relationship is unlikely to be caused by an effect of T rep on 
S-phase transcription, which is one of the two major clas- 
ses of models that have been proposed to explain the tran- 
scription/ T rep association [8,25,26]. 

To test whether the relationship between S-phase 
transcription and replication timing is conserved outside 
of fungi, I applied the same correlation analysis to cell- 
cycle gene expression and T rep data from human HeLa 
cells [6,30]. Analyzing all known HeLa cell-cycle regu- 
lated genes [30], I found no significant relationship of 



any kind (Figure 4A). However, applying each of the two 
filters identified from yeast - late T rep and ORI proximity 
(within 10 kb) - resulted in clear and significant oscilla- 
tions, of a magnitude similar to that observed for both 
yeast species (Figure 4B,C). As observed for yeast, the 
minimum correlation (indicating early T rep of up-regulated 
genes) occurred in G2, and the maximum in late M/Gl. 
The fact that the same oscillating relationship exists in 
human, and that its strength is influenced by the same 
two factors, suggests that it is likely to be caused by a 
mechanism conserved between fungi and metazoans. 

To put into perspective the strength of the relationship 
between T rep and cell cycle-regulated gene expression in 
human, I compared it to the well-established association 
between T rep and average (asynchronous) gene expres- 
sion. The latter provides a useful benchmark because it 
is regarded as a strong relationship that has been ob- 
served in numerous studies across diverse metazoans 
[5,6,8,9]. To facilitate a direct comparison with the re- 
sults in Figure 4, I used the same T rep data [6] for the 
same genes, but replaced the cell cycle-synchronized 
gene expression data [30] with high-coverage RNA-seq 
data from asynchronous HeLa cells [31]. The correlation 
between asynchronous expression and T rep was r= -0.16 
for late T rep genes (the genes represented by the red line 
in Figure 4B) and r= -0.15 for ORI-proximal genes (rep- 
resented by the blue line in Figure 4C). In both cases, 
the asynchronous data explained less than a third of the 
variance in T rep that is explained by S-phase transcrip- 
tion (see Materials and methods). Differing quality of 
the two gene expression data sets [30,31] could contrib- 
ute to this difference; however, because RNA-seq is of 
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Figure 3 Factors affecting the strength of the transcription/T rep association. (A) Median G2-phase transcript levels (representing S-phase 
transcription; Additional file 1: Figure S4A) are shown for all cell cycle-regulated genes separated into 10 equally sized bins (deciles) by their T rep . 
For both yeast species, no correlation is observed for the first five bins, whereas a strong relationship is present for later T rep . (B) Consistent with 
the decile analysis, no oscillation is observed in the correlation between expression level and T rep for early T rep genes, while a strong oscillation is 
observed for late T rep genes. (C) Only weak oscillation is observed in the correlation between expression level and T rep for ORI-distal genes (>5 kb 
from the nearest ORI in budding yeast, or 10 kb in fission yeast), while a strong oscillation is observed for ORI-proximal genes. 



far higher precision than spotted cDNA microarrays 
[32], any difference would likely underestimate the 
strength of the cell-cycle oscillations (Figure 4). These 
results suggest that the relationship between T rep and 
S-phase transcription in human is substantially stronger 
than the well-established association with asynchronous 
expression. 

Discussion 

These results suggest that 1) S-phase transcription is as- 
sociated with DNA replication timing in budding yeast, 
fission yeast, and human; 2) the association is strongest 
for genomic regions near ORIs, excluding the causal 



model in which T rep affects transcription [8,9,25,26]; 3) 
it is also strongest for regions replicated in late S phase, 
implying that early-firing ORIs are not affected by this 
relationship; and 4) this association explains at least 
three times more of the variability in T rep than the well- 
known association with (asynchronous) gene expression 
in human. 

Although the replication of these patterns across three 
species (and across multiple data sets within species; 
Additional file 1: Figures SI and S2) lends confidence to 
their robustness, several caveats should be considered. 
First, gene expression was represented by transcript abun- 
dances, which is a function of both transcription and 
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Figure 4 Transcription and T rep in human. (A) No oscillation is 
observed when comparing the T rep versus expression levels of all 
cell cycle-regulated genes in HeLa cells (all |r| > 0.063 are significant 
at P< 0.05; the four time-points that exceed this are within the 
range expected by chance, given that 47 time-points were 
analyzed). (B) Significant oscillation is observed when comparing 
T rep versus expression levels of cell cycle-regulated genes with late 
T rep (red line; the final 50% of S phase; all |r| > 0.195 are significant), 
but not early T rep (blue line). (C) Significant oscillation is observed 
when comparing T rep versus expression levels of cell cycle-regulated 
genes within 10 kb of an ORI (blue line; all |r| > 0.1 97 are significant), 
but not further than 10 kb from an ORI (red line). 



transcription have been measured throughout the cell 
cycle. Second, data quality is critical in any analysis; poor- 
quality data can reduce, or entirely mask, a real relation- 
ship. However, in most analyses reported here this is not a 
major concern, because it could only make the current re- 
sults conservative (one exception to this is the ORI- 
proximal versus distal analyses (Figures 3C and 4C): if T rep 
was measured more accurately near ORIs, this would lead 
to stronger ORI-proximal correlations, although add- 
itional analysis suggests this is not the case (see Materials 
and methods)). Third, correlation does not imply caus- 
ation. Although the evidence does not support a model 
where T rep affects transcription (Figures 3C and 4C), I 
cannot determine whether transcription itself is affecting 
T rep , or whether unobserved (latent) factors may be in- 
volved. With this caveat in mind, I believe there is still suf- 
ficient evidence to propose a testable model to account 
for these data. 

A plausible mechanism explaining these observa- 
tions draws from the finding that the firing of ORIs in 
late S phase is governed by recruitment of limiting replica- 
tion initiation factors [12,17-20]. These factors are seques- 
tered by early-firing ORIs from Gl until early S phase, 
and are reused at late-firing ORIs after their release from 
early-firing ORIs. I propose that the level of S -phase tran- 
scription near a late-firing ORI reflects local chromatin ac- 
cessibility and/or subnuclear positioning, and in turn 
the ability of ORIs to recruit these limiting factors dur- 
ing S phase (Figure 5). This model accounts for the re- 
lationship of T rep with S -phase transcription (and the 
differing relationships in other phases); for the relation- 
ship being strongest near late-firing ORIs; and for the 
inferred direction of causality (that is, T rep not being 
causal). 

The proposed mechanism likely acts in concert with 
other factors determining T rep , and thus is not incon- 
sistent with evidence supporting these other factors. 
For example, although the determination of early versus 
late-firing ORIs is completed during M/Gl [12-14], 
S -phase transcription may still influence firing time 
specifically at late-firing ORIs (Figure 5). 

Conclusions 

Future work integrating these results with other (non-mu- 
tually-exclusive) mechanisms affecting T rep - for example, 
Forkhead transcription factors [33] and subnuclear posi- 
tioning [8,29,34,35] - may lead to a unified framework for 
understanding the causes, and consequences, of the tem- 
poral program of DNA replication across eukaryotes. 



mRNA decay; therefore, the correlations reported here 
may underestimate the relationship between transcription 
and T rep . This prediction can be tested once rates of species [2,4,6], and mapped onto genes by linear 



Materials and methods 

Data sources 

Genome-wide T rep values were downloaded for all three 
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Figure 5 A model to explain these observations. Components: ORC and MCM2-7 are protein complexes comprising the pre-replicative complex. 
Blue cylinders represent nucleosomes, with dark blue indicating closed/repressive chromatin and light blue indicating open/accessible chromatin. Red 
proteins are limiting replication initiation factors (such as Cdc45 and Sld3). Txn = transcription. Sequence of events: in G1 (not depicted), the limiting 
replication initiation factors (red circles) associate with the earliest-firing ORIs (top row). When S phase begins, these early ORIs fire and release the 
factors, which are then free to associate with other ORIs (though note that Cdc45 is a component of the replication fork, so can only be recycled after 
fork termination). The relative affinities of the remaining ORIs for these factors - and thus their relative firing times - are determined by the chromatin 
state near the ORI during S-phase. ORIs near genes highly transcribed in S phase (middle row) have an accessible chromatin structure and thus high 
affinity, so will tend to fire earlier than those with little nearby S-phase transcription and thus less accessible chromatin (bottom row). Although not 
shown here, subnuclear positioning could help determine ORI accessibility, either by influencing chromatin structure or through other mechanisms. 
Figure adapted from [19]. 



interpolation to the genes midpoint. Asynchronous yeast 
expression levels (used in Figure 2) were taken from 
[36,37], using the poly- A data for Sc and the median of 
wild-type replicates for Sp. Asynchronous HeLa RNA- 
seq data were from the ENCODE project [31]. Identities 
of cell cycle-regulated genes, their expression levels, and 
the cell-cycle phase of each expression time-point were 
acquired from [27,28,30]. All cell-cycle expression data 
were measured as mRNA levels relative to asynchronous 
levels of each gene, as opposed to absolute mRNA abun- 
dances that can be measured by RNA-seq; therefore, 
these expression levels represent the relative induction 
or repression of each gene throughout the cell cycle. 
The order of maximum expression levels was obtained 
from [38] for Sp and [27] for Sc. ORI locations were 
downloaded from ORIdb [39] for both yeasts (using only 
confirmed' or 'likely ORIs), and from [6] for human (see 
below). 

Yeast data analysis 

All correlations were Pearsons (significance cutoffs given 
in each figure legend). T rep moving averages (Figure 1C) 
were calculated for windows of 100 genes for Sc and 60 
genes for Sp (due to the smaller number of cell cycle- 
regulated genes in Sp). For Figures 1A and 3 A, the G2 
expression data were represented by the 42 minute 
time-point for Sc and 135 minute time-point for Sp; for 
Figure 1A, Sc M/Gl was represented by the 70 minute 



time-point, and Sp Gl was represented by the 225 minute 
time-point. For Figure 3B, the early/late S-phase cutoff 
was chosen at halfway through S phase of each T rep data 
set (39.6 minutes after release from hydroxyurea arrest in 
Sp, and 26.8 minutes after release from cdc7 arrest in Sc). 
The cutoff for ORI-proximal versus ORI-distal (5 kb from 
each genes 5' end in Sc and 10 kb in Sp) was chosen in 
each yeast to result in gene lists of approximately equal 
size. 

P-values in Figure 2 were calculated with a two-tailed 
Students £-test. Because the Sc expression levels were 
calculated as a ratio of mRNA/genomic DNA from asyn- 
chronous cells [37], they represent the number of 
mRNAs per DNA copy, and thus account for the fact 
that genes with early T rep spend a greater portion of the 
cell cycle with two copies. Although the Sp expression 
data [36] do not account for this, correcting for the ef- 
fect by subtracting a fraction of each expression level 
proportional to the time each gene spends with two cop- 
ies had only a minimal effect. 

All code and data are available at [40]. 

Human data analysis 

Human ORIs were defined as Orel binding sites [6] lo- 
cated within 1 mb of early-replicating peaks in the 
HeLa T rep profile, which indicate active ORIs (this win- 
dow size was necessitated by the low resolution of the 
T rep profile) [6]. The early/late T rep cutoff was the first 
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50% of S phase and the ORI-proximal/distal cutoff was 
10 kb from each gene s 5' end. Due to the higher num- 
ber of expression data points per cell cycle in human 
(approximately 15 in human versus approximately 9 for 
both yeasts), a two-point moving average was used for 
plotting human correlation coefficients. 

To compare asynchronous expression versus S -phase 
transcription in HeLa cells, I compared high-coverage 
RNA-seq data from HeLa cells [31] with T rep [6] for the 
same genes analyzed in Figure 4B,C. The fraction of vari- 
ance in T rep explained by the expression data is simply the 
r 2 value from the Pearsons correlation. Comparing these 
values for the asynchronous data with the strongest G2- 
phase (used to represent S -phase transcription, as de- 
scribed above) correlations, among the late-replicating 
genes (represented by the red line in Figure 4B) 2.7% of 
the variance in T rep was explained by the asynchronous 
data, versus 8.1% for S -phase transcription. Likewise 
for ORI-proximal genes (represented by the blue line in 
Figure 4C), the asynchronous data explained 2.3% of the 
variance in T rep , versus 7.6% for S -phase transcription. 

To determine whether T rep is measured with greater 
accuracy near ORIs, I compared the T rep data used in 
Figure 4 [6] with an independent T rep data set from 
HeLa cells [41]. Restricting the analysis to the cell cycle- 
regulated genes analyzed in Figure 4C, I found that ORI- 
distal genes actually showed better agreement between 
T rep data sets than did ORI-proximal genes (r = 0.59 and 
0.46, respectively). This implies that, if anything, T rep is 
measured less accurately in ORI-proximal regions, which 
would lead to an underestimate of the strength of the 
oscillating correlation (blue line in Figure 4C). 

Additional file 
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